17 resultados para Recent advances in Molecular Biology

em Helda - Digital Repository of University of Helsinki


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Scattering of X-rays and neutrons has been applied to the study of nanostructures with interesting biological functions. The systems studied were the protein calmodulin and its complexes, bacterial virus bacteriophage phi6, and the photosynthetic antenna complex from green sulfur bacteria, chlorosome. Information gathered using various structure determination methods has been combined to the low resolution information obtained from solution scattering. Conformational changes in calmodulin-ligand complex were studied by combining the directional information obtained from residual dipole couplings in nuclear magnetic resonance to the size information obtained from small-angle X-ray scattering from solution. The locations of non-structural protein components in a model of bacteriophage phi6, based mainly on electron microscopy, were determined by neutron scattering, deuterium labeling and contrast variation. New data are presented on the structure of the photosynthetic antenna complex of green sulfur bacteria and filamentous anoxygenic phototrophs, also known as the chlorosome. The X-ray scattering and electron cryomicroscopy results from this system are interpreted in the context of a new structural model detailed in the third paper of this dissertation. The model is found to be consistent with the results obtained from various chlorosome containing bacteria. The effect of carotenoid synthesis on the chlorosome structure and self-assembly are studied by carotenoid extraction, biosynthesis inhibition and genetic manipulation of the enzymes involved in carotenoid biosynthesis. Carotenoid composition and content are found to have a marked effect on the structural parameters and morphology of chlorosomes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: The Ewing sarcoma family of tumors (ESFT) are rare but highly malignant neoplasms that occur mainly in bone or but also in soft tissue. ESFT affects patients typically in their second decade of life, whereby children and adolescents bear the heaviest incidence burden. Despite recent advances in the clinical management of ESFT patients, their prognosis and survival are still disappointingly poor, especially in cases with metastasis. No targeted therapy for ESFT patients is currently available. Moreover, based merely on current clinical and biological characteristics, accurate classification of ESFT patients often fails at the time of diagnosis. Therefore, there is a constant need for novel molecular biomarkers to be applied in tandem with conventional parameters to further intensify ESFT risk-stratification and treatment selection, and ultimately to develop novel targeted therapies. In this context, a greater understanding of the genetics and immune characteristics of ESFT is needed. Aims: This study sought to open novel insights into gene copy number changes and gene expression in ESFT and, further, to enlighten the role of inflammation in ESFT. For this purpose, microarrays were used to provide gene-level information on a genomewide scale. In addition, this study focused on screening of 9p21.3 deletion sizes and frequencies in ESFT and, in another pediatric cancer, acute lymphocytic leukemia (ALL), in order to define more exact criteria for highrisk patient selection and to provide data for developing a more reliable diagnostic method to detect CDKN2A deletions. Results: In study I, 20 novel ESFT-associated suppressor genes and oncogenes were pinpointed using combined array CGH and expression analysis. In addition, interesting chromosomal rearrangements were identified: (1) Duplication of derivative chromosome der(22)(11;22) was detected in three ESFT patients. This duplication included the EWSR1-FLI1 fusion gene leading to increase in its copy number; (2) Cryptic amplifications on chromosomes 20 and 22 were detected, suggesting a novel translocation between chromosomes 20 and 22, which most probably produces a fusion between EWSR1 and NFATC2. In study II, bioinformatic analysis of ESFT expression profiles showed that inflammatory gene activation is detectable in ESFT patient samples and that the activation is characterized by macrophage gene expression. Most interestingly, ESFT patient samples were shown to express certain inflammatory genes that were prognostically significant. High local expression of C5 and JAK1 at the tumor site was shown to associate with favorable clinical outcome, whereas high local expression of IL8 was shown to be detrimental. Studies III and IV showed that the smallest overlapping region of deletion in 9p21.3 includes CDKN2A in all cases and that the length of this region is 12.2 kb in both Ewing sarcoma and ALL. Furthermore, our results showed that the most widely used commercial CDKN2A FISH probe creates false negative results in the narrowest microdeletion cases (<190 kb). Therefore, more accurate methods should be developed for the detection of deletions in the CDKN2A locus. Conclusions: This study provides novel insights into the genetic changes involved in the biology of ESFT, in the interaction between ESFT cells and immune system, and in the inactivation of CDKN2A. Novel ESFT biomarker genes identified in this study serve as a useful resource for future studies and in developing novel therapeutic strategies to improve the survival of patients with ESFT.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Staphylococcus aureus is one of the most important bacteria that cause disease in humans, and methicillin-resistant S. aureus (MRSA) has become the most commonly identified antibiotic-resistant pathogen in many parts of the world. MRSA rates have been stable for many years in the Nordic countries and the Netherlands with a low MRSA prevalence in Europe, but in the recent decades, MRSA rates have increased in those low-prevalence countries as well. MRSA has been established as a major hospital pathogen, but has also been found increasingly in long-term facilities (LTF) and in communities of persons with no connections to the health-care setting. In Finland, the annual number of MRSA isolates reported to the National Infectious Disease Register (NIDR) has constantly increased, especially outside the Helsinki metropolitan area. Molecular typing has revealed numerous outbreak strains of MRSA, some of which have previously been associated with community acquisition. In this work, data on MRSA cases notified to the NIDR and on MRSA strain types identified with pulsed-field gel electrophoresis (PFGE), multilocus sequence typing (MLST), and staphylococcal cassette chromosome mec (SCCmec) typing at the National Reference Laboratory (NRL) in Finland from 1997 to 2004 were analyzed. An increasing trend in MRSA incidence in Finland from 1997 to 2004 was shown. In addition, non-multi-drug resistant (NMDR) MRSA isolates, especially those resistant only to methicillin/oxacillin, showed an emerging trend. The predominant MRSA strains changed over time and place, but two internationally spread epidemic strains of MRSA, FIN-16 and FIN-21, were related to the increase detected most recently. Those strains were also one cause of the strikingly increasing invasive MRSA findings. The rise of MRSA strains with SCCmec types IV or V, possible community-acquired MRSA was also detected. With questionnaires, the diagnostic methods used for MRSA identification in Finnish microbiology laboratories and the number of MRSA screening specimens studied were reviewed. Surveys, which focused on the MRSA situation in long-term facilities in 2001 and on the background information of MRSA-positive persons in 2001-2003, were also carried out. The rates of MRSA and screening practices varied widely across geographic regions. Part of the NMDR MRSA strains could remain undetected in some laboratories because of insufficient diagnostic techniques used. The increasing proportion of elderly population carrying MRSA suggests that MRSA is an emerging problem in Finnish long-term facilities. Among the patients, 50% of the specimens were taken on a clinical basis, 43% on a screening basis after exposure to MRSA, 3% on a screening basis because of hospital contact abroad, and 4% for other reasons. In response to an outbreak of MRSA possessing a new genotype that occurred in a health care ward and in an associated nursing home of a small municipality in Northern Finland in autumn 2003, a point-prevalence survey was performed six months later. In the same study, the molecular epidemiology of MRSA and methicillin-sensitive S. aureus (MSSA) strains were also assessed, the results to the national strain collection compared, and the difficulties of MRSA screening with low-level oxacillin-resistant isolates encountered. The original MRSA outbreak in LTF, which consisted of isolates possessing a nationally new PFGE profile (FIN-22) and internationally rare MLST type (ST-27), was confined. Another previously unrecognized MRSA strain was found with additional screening, possibly indicating that current routine MRSA screening methods may be insufficiently sensitive for strains possessing low-level oxacillin resistance. Most of the MSSA strains found were genotypically related to the epidemic MRSA strains, but only a few of them had received the SCCmec element, and all those strains possessed the new SCCmec type V. In the second largest nursing home in Finland, the colonization of S. aureus and MRSA, and the role of screening sites along with broth enrichment culture on the sensitivity to detect S. aureus were studied. Combining the use of enrichment broth and perineal swabbing, in addition to nostrils and skin lesions swabbing, may be an alternative for throat swabs in the nursing home setting, especially when residents are uncooperative. Finally, in order to evaluate adequate phenotypic and genotypic methods needed for reliable laboratory diagnostics of MRSA, oxacillin disk diffusion and MIC tests to the cefoxitin disk diffusion method at both +35°C and +30°C, both with or without an addition of sodium chloride (NaCl) to the Müller Hinton test medium, and in-house PCR to two commercial molecular methods (the GenoType® MRSA test and the EVIGENETM MRSA Detection test) with different bacterial species in addition to S. aureus were compared. The cefoxitin disk diffusion method was superior to that of oxacillin disk diffusion and to the MIC tests in predicting mecA-mediated resistance in S. aureus when incubating at +35°C with or without the addition of NaCl to the test medium. Both the Geno Type® MRSA and EVIGENETM MRSA Detection tests are usable, accurate, cost-effective, and sufficiently fast methods for rapid MRSA confirmation from a pure culture.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Progressive myoclonus epilepsy of Unverricht-Lundborg type (EPM1) is an autosomal recessively inherited disorder characterized by age of onset at 6-15 years, stimulus-sensitive myoclonus, tonic-clonic epileptic seizures and a progressive course. Mutations in the cystatin B (CSTB) gene underlie EPM1. The most common mutation underlying EPM1 is a dodecamer repeat expansion in the promoter region of CSTB. In addition, nine other mutations have been identified. CSTB, a cysteine protease inhibitor, is a ubiquitously expressed inhibitor of cathepsins, but its physiological function is unknown. The purpose of this study was to investigate CSTB gene expression and CSTB protein function in normal and pathological conditions. The basal CSTB promoter was mapped and characterized using different promoter-luciferase gene constructs. The binding activity of transcription factors to one ARE half, five Sp1 and four AP1 sites in the CSTB promoter was demonstrated. The CSTB promoter activity was clearly decreased using a CSTB promoter with "premutation" repeat expansions and in individuals with alike expansions. The expression of CSTB mRNA and protein was markedly reduced in patient cells. The endogenous CSTB protein localized to the nucleus, cytoplasm and lysosomes, and in differentiated cells merely to the cytoplasm. This suggests that the subcellular distribution of CSTB is dependent on the differentation status of the cells. The proteins representing patient missense mutations failed to associate with lysosomes, implying the importance of the lysosomal association for the proper physiological function of CSTB. Several alternatively spliced CSTB isoforms were identified. Of these CSTB2 was widely expressed with very low levels whereas the other alternatively spliced forms seemed to have limited tissue expression. In patients CSTB2 expression was reduced similarly to that of CSTB. The physiological relevance of CSTB alternative splicing remains unknown. The mouse Cstb transcript was shown to be present in all embryonic stages and adult tissues examined. The expression was highest at embryonic day 7 and in thymus, as well as in postnatal brain in the cortex, caudate putamen, thalamus, hippocampus, and in the Purkinje cell layer of the cerebellum. Our data implies that CSTB expression is tightly temporally and spatially regulated. The data presented in my thesis lay the basis for further understanding of the role of CSTB in health and disease.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Epilysin (MMP-28) is the most recently identified member of the matrix metalloproteinase (MMP) family of extracellular proteases. Together these enzymes are capable of degrading almost all components of the extracellular matrix (ECM) and are thus involved in important biological processes such as development, wound healing and immune functions, but also in pathological processes such as tumor invasion, metastasis and arthritis. MMPs do not act solely by degrading the ECM. They also regulate cell behavior by releasing growth factors and biologically active peptides from the ECM, by modulating cell surface receptors and adhesion molecules and by regulating the activity of many important mediators in inflammatory pathways. The aim of this study was to define the unique role of epilysin within the MMP-family, to elucidate how and when it is expressed and how its catalytic activity is regulated. To gain information on its essential functions and substrates, the specific aim was to characterize how epilysin affects the phenotype of epithelial cells, where it is biologically expressed. During the course of the study we found that the epilysin promoter contains a well conserved GT-box that is essential for the basic expression of this gene. Transcription factors Sp1 and Sp3 bind this sequence and could hence regulate both the basic and cell type and differentiation stage specific expression of epilysin. We cloned mouse epilysin cDNA and found that epilysin is well conserved between human and mouse genomes and that epilysin is glycosylated and activated by furin. Similarly to in human tissues, epilysin is normally expressed in a number of mouse tissues. The expression pattern differs from most other MMPs, which are expressed only in response to injury or inflammation and in pathological processes like cancer. These findings implicate that epilysin could be involved in tissue homeostasis, perhaps fine-tuning the phenotype of epithelial cells according to signals from the ECM. In view of these results, it was unexpected to find that epilysin can induce a stable epithelial to mesenchymal transition (EMT) when overexpressed in epithelial lung carcinoma cells. Transforming growth factor b (TGF-b) was recognized as a crucial mediator of this process, which was characterized by the loss of E-cadherin mediated cell-cell adhesion, elevated expression of gelatinase B and MT1-MMP and increased cell migration and invasion into collagen I gels. We also observed that epilysin is bound to the surface of epithelial cells and that this interaction is lost upon cell transformation and is susceptible to degradation by membrane type-1-MMP (MT1-MMP). The wide expression of epilysin under physiological conditions implicates that its effects on epithelial cell phenotype in vivo are not as dramatic as seen in our in vitro cell system. Nevertheless, current results indicate a possible interaction between epilysin and TGF-b also under physiological circumstances, where epilysin activity may not induce EMT but, instead, trigger less permanent changes in TGF-b signaling and cell motility. Epilysin may thus play an important role in TGF-b regulated events such as wound healing and inflammation, processes where involvement of epilysin has been indicated.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A repetitive sequence collection is one where portions of a base sequence of length n are repeated many times with small variations, forming a collection of total length N. Examples of such collections are version control data and genome sequences of individuals, where the differences can be expressed by lists of basic edit operations. Flexible and efficient data analysis on a such typically huge collection is plausible using suffix trees. However, suffix tree occupies O(N log N) bits, which very soon inhibits in-memory analyses. Recent advances in full-text self-indexing reduce the space of suffix tree to O(N log σ) bits, where σ is the alphabet size. In practice, the space reduction is more than 10-fold, for example on suffix tree of Human Genome. However, this reduction factor remains constant when more sequences are added to the collection. We develop a new family of self-indexes suited for the repetitive sequence collection setting. Their expected space requirement depends only on the length n of the base sequence and the number s of variations in its repeated copies. That is, the space reduction factor is no longer constant, but depends on N / n. We believe the structures developed in this work will provide a fundamental basis for storage and retrieval of individual genomes as they become available due to rapid progress in the sequencing technologies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Mass spectrometry (MS) became a standard tool for identifying metabolites in biological tissues, and metabolomics is slowly acknowledged as a legitimate research discipline for characterizing biological conditions. The computational analyses of metabolomics, however, lag behind compared with the rapid advances in analytical aspects for two reasons. First is the lack of standardized data repository for mass spectra: each research institution is flooded with gigabytes of mass-spectral data from its own analytical groups and cannot host a world-class repository for mass spectra. The second reason is the lack of informatics experts that are fully experienced with spectral analyses. The two barriers must be overcome to establish a publicly free data server for MS analysis in metabolomics as does GenBank in genomics and UniProt in proteomics. The workshop brought together bioinformaticians working on mass spectral analyses in Finland and Japan with the goal to establish a consortium to freely exchange and publicize mass spectra of metabolites measured on various platforms computational tools to analyze spectra spectral knowledge that are computationally predicted from standardized data. This book contains the abstracts of the presentations given in the workshop. The programme of the workshop consisted of oral presentations from Japan and Finland, invited lectures from Steffen Neumann (Leibniz Institute of Plant Biochemistry), Matej Oresic (VTT), Merja Penttila (VTT) and Nicola Zamboni (ETH Zurich) as well as free form discussion among the participants. The event was funded by Academy of Finland (grants 139203 and 118653), Japan Society for the Promotion of Science (JSPS Japan-Finland Bilateral Semi- nar Program 2010) and Department of Computer Science University of Helsinki. We would like to thank all the people contributing to the technical pro- gramme and the sponsors for making the workshop possible. Helsinki, October 2010 Masanori Arita, Markus Heinonen and Juho Rousu

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The aim of the present study was to advance the methodology and use of time series analysis to quantify dynamic structures in psychophysiological processes and thereby to produce information on spontaneously coupled physiological responses and their behavioral and experiential correlates. Series of analyses using both simulated and empirical cardiac (IBI), electrodermal (EDA), and facial electromyographic (EMG) data indicated that, despite potential autocorrelated structures, smoothing increased the reliability of detecting response coupling from an interindividual distribution of intraindividual measures and that especially the measures of covariance produced accurate information on the extent of coupled responses. This methodology was applied to analyze spontaneously coupled IBI, EDA, and facial EMG responses and vagal activity in their relation to emotional experience and personality characteristics in a group of middle-aged men (n = 37) during the administration of the Rorschach testing protocol. The results revealed new characteristics in the relationship between phasic end-organ synchronization and vagal activity, on the one hand, and individual differences in emotional adjustment to novel situations on the other. Specifically, it appeared that the vagal system is intimately related to emotional and social responsivity. It was also found that the lack of spontaneously synchronized responses is related to decreased energetic arousal (e.g., depression, mood). These findings indicate that the present process analysis approach has many advantages for use in both experimental and applied research, and that it is a useful new paradigm in psychophysiological research. Keywords: Autonomic Nervous System; Emotion; Facial Electromyography; Individual Differences; Spontaneous Responses; Time Series Analysis; Vagal System

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Microarrays have a wide range of applications in the biomedical field. From the beginning, arrays have mostly been utilized in cancer research, including classification of tumors into different subgroups and identification of clinical associations. In the microarray format, a collection of small features, such as different oligonucleotides, is attached to a solid support. The advantage of microarray technology is the ability to simultaneously measure changes in the levels of multiple biomolecules. Because many diseases, including cancer, are complex, involving an interplay between various genes and environmental factors, the detection of only a single marker molecule is usually insufficient for determining disease status. Thus, a technique that simultaneously collects information on multiple molecules allows better insights into a complex disease. Since microarrays can be custom-manufactured or obtained from a number of commercial providers, understanding data quality and comparability between different platforms is important to enable the use of the technology to areas beyond basic research. When standardized, integrated array data could ultimately help to offer a complete profile of the disease, illuminating mechanisms and genes behind disorders as well as facilitating disease diagnostics. In the first part of this work, we aimed to elucidate the comparability of gene expression measurements from different oligonucleotide and cDNA microarray platforms. We compared three different gene expression microarrays; one was a commercial oligonucleotide microarray and the others commercial and custom-made cDNA microarrays. The filtered gene expression data from the commercial platforms correlated better across experiments (r=0.78-0.86) than the expression data between the custom-made and either of the two commercial platforms (r=0.62-0.76). Although the results from different platforms correlated reasonably well, combining and comparing the measurements were not straightforward. The clone errors on the custom-made array and annotation and technical differences between the platforms introduced variability in the data. In conclusion, the different gene expression microarray platforms provided results sufficiently concordant for the research setting, but the variability represents a challenge for developing diagnostic applications for the microarrays. In the second part of the work, we performed an integrated high-resolution microarray analysis of gene copy number and expression in 38 laryngeal and oral tongue squamous cell carcinoma cell lines and primary tumors. Our aim was to pinpoint genes for which expression was impacted by changes in copy number. The data revealed that especially amplifications had a clear impact on gene expression. Across the genome, 14-32% of genes in the highly amplified regions (copy number ratio >2.5) had associated overexpression. The impact of decreased copy number on gene underexpression was less clear. Using statistical analysis across the samples, we systematically identified hundreds of genes for which an increased copy number was associated with increased expression. For example, our data implied that FADD and PPFIA1 were frequently overexpressed at the 11q13 amplicon in HNSCC. The 11q13 amplicon, including known oncogenes such as CCND1 and CTTN, is well-characterized in different type of cancers, but the roles of FADD and PPFIA1 remain obscure. Taken together, the integrated microarray analysis revealed a number of known as well as novel target genes in altered regions in HNSCC. The identified genes provide a basis for functional validation and may eventually lead to the identification of novel candidates for targeted therapy in HNSCC.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Standard Model of particle physics consists of the quantum electrodynamics (QED) and the weak and strong nuclear interactions. The QED is the basis for molecular properties, and thus it defines much of the world we see. The weak nuclear interaction is responsible for decays of nuclei, among other things, and in principle, it should also effects at the molecular scale. The strong nuclear interaction is hidden in interactions inside nuclei. From the high-energy and atomic experiments it is known that the weak interaction does not conserve parity. Consequently, the weak interaction and specifically the exchange of the Z^0 boson between a nucleon and an electron induces small energy shifts of different sign for mirror image molecules. This in turn will make the other enantiomer of a molecule energetically favorable than the other and also shifts the spectral lines of the mirror image pair of molecules into different directions creating a split. Parity violation (PV) in molecules, however, has not been observed. The topic of this thesis is how the weak interaction affects certain molecular magnetic properties, namely certain parameters of nuclear magnetic resonance (NMR) and electron spin resonance (ESR) spectroscopies. The thesis consists of numerical estimates of NMR and ESR spectral parameters and investigations of the effects of different aspects of quantum chemical computations to them. PV contributions to the NMR shielding and spin-spin coupling constants are investigated from the computational point of view. All the aspects of quantum chemical electronic structure computations are found to be very important, which makes accurate computations challenging. Effects of molecular geometry are also investigated using a model system of polysilyene chains. PV contribution to the NMR shielding constant is found to saturate after the chain reaches a certain length, but the effects of local geometry can be large. Rigorous vibrational averaging is also performed for a relatively small and rigid molecule. Vibrational corrections to the PV contribution are found to be only a couple of per cents. PV contributions to the ESR g-tensor are also evaluated using a series of molecules. Unfortunately, all the estimates are below the experimental limits, but PV in some of the heavier molecules comes close to the present day experimental resolution.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Advancements in the analysis techniques have led to a rapid accumulation of biological data in databases. Such data often are in the form of sequences of observations, examples including DNA sequences and amino acid sequences of proteins. The scale and quality of the data give promises of answering various biologically relevant questions in more detail than what has been possible before. For example, one may wish to identify areas in an amino acid sequence, which are important for the function of the corresponding protein, or investigate how characteristics on the level of DNA sequence affect the adaptation of a bacterial species to its environment. Many of the interesting questions are intimately associated with the understanding of the evolutionary relationships among the items under consideration. The aim of this work is to develop novel statistical models and computational techniques to meet with the challenge of deriving meaning from the increasing amounts of data. Our main concern is on modeling the evolutionary relationships based on the observed molecular data. We operate within a Bayesian statistical framework, which allows a probabilistic quantification of the uncertainties related to a particular solution. As the basis of our modeling approach we utilize a partition model, which is used to describe the structure of data by appropriately dividing the data items into clusters of related items. Generalizations and modifications of the partition model are developed and applied to various problems. Large-scale data sets provide also a computational challenge. The models used to describe the data must be realistic enough to capture the essential features of the current modeling task but, at the same time, simple enough to make it possible to carry out the inference in practice. The partition model fulfills these two requirements. The problem-specific features can be taken into account by modifying the prior probability distributions of the model parameters. The computational efficiency stems from the ability to integrate out the parameters of the partition model analytically, which enables the use of efficient stochastic search algorithms.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Genetics, the science of heredity and variation in living organisms, has a central role in medicine, in breeding crops and livestock, and in studying fundamental topics of biological sciences such as evolution and cell functioning. Currently the field of genetics is under a rapid development because of the recent advances in technologies by which molecular data can be obtained from living organisms. In order that most information from such data can be extracted, the analyses need to be carried out using statistical models that are tailored to take account of the particular genetic processes. In this thesis we formulate and analyze Bayesian models for genetic marker data of contemporary individuals. The major focus is on the modeling of the unobserved recent ancestry of the sampled individuals (say, for tens of generations or so), which is carried out by using explicit probabilistic reconstructions of the pedigree structures accompanied by the gene flows at the marker loci. For such a recent history, the recombination process is the major genetic force that shapes the genomes of the individuals, and it is included in the model by assuming that the recombination fractions between the adjacent markers are known. The posterior distribution of the unobserved history of the individuals is studied conditionally on the observed marker data by using a Markov chain Monte Carlo algorithm (MCMC). The example analyses consider estimation of the population structure, relatedness structure (both at the level of whole genomes as well as at each marker separately), and haplotype configurations. For situations where the pedigree structure is partially known, an algorithm to create an initial state for the MCMC algorithm is given. Furthermore, the thesis includes an extension of the model for the recent genetic history to situations where also a quantitative phenotype has been measured from the contemporary individuals. In that case the goal is to identify positions on the genome that affect the observed phenotypic values. This task is carried out within the Bayesian framework, where the number and the relative effects of the quantitative trait loci are treated as random variables whose posterior distribution is studied conditionally on the observed genetic and phenotypic data. In addition, the thesis contains an extension of a widely-used haplotyping method, the PHASE algorithm, to settings where genetic material from several individuals has been pooled together, and the allele frequencies of each pool are determined in a single genotyping.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Transposons, mobile genetic elements that are ubiquitous in all living organisms have been used as tools in molecular biology for decades. They have the ability to move into discrete DNA locations with no apparent homology to the target site. The utility of transposons as molecular tools is based on their ability to integrate into various DNA sequences efficiently, producing extensive mutant clone libraries that can be used in various molecular biology applications. Bacteriophage Mu is one of the most useful transposons due to its well-characterized and simple in vitro transposition reaction. This study establishes the properties of the Mu in vitro transposition system as a versatile multipurpose tool in molecular biology. In addition, this study describes Mu-based applications for engineering proteins by random insertional transposon mutagenesis in order to study structure-function relationships in proteins. We initially characterized the properties of the minimal Mu in vitro transposition system. We showed that the Mu transposition system works efficiently and accurately and produces insertions into a wide spectrum of target sites in different DNA molecules. Then, we developed a pentapeptide insertion mutagenesis strategy for inserting random five amino acid cassettes into proteins. These protein variants can be used especially for screening important sites for protein-protein interactions. Also, the system may produce temperature-sensitive variants of the protein of interest. Furthermore, we developed an efficient screening system for high-resolution mapping of protein-protein interfaces with the pentapeptide insertion mutagenesis. This was accomplished by combining the mutagenesis with subsequent yeast two-hybrid screening and PCR-based genetic footprinting. This combination allows the analysis of the whole mutant library en masse, without the need for producing or isolating separate mutant clones, and the protein-protein interfaces can be determined at amino acid accuracy. The system was validated by analysing the interacting region of JFC1 with Rab8A, and we show that the interaction is mediated via the JFC1 Slp homology domain. In addition, we developed a procedure for the production of nested sets of N- and C-terminal deletion variants of proteins with the Mu system. These variants are useful in many functional studies of proteins, especially in mapping regions involved in protein-protein interactions. This methodology was validated by analysing the region in yeast Mso1 involved in an interaction with Sec1. The results of this study show that the Mu in vitro transposition system is versatile for various applicational purposes and can efficiently be adapted to random protein engineering applications for functional studies of proteins.